Selecting clickstream data mining plans using a case-based reasoning application
نویسنده
چکیده
Despite the increasing interest and wide use of data mining tools, extracting useful knowledge from the growing available data remains a complex task. The Web environment and clickstream data demand even more efficacy to knowledge discovery: such data is a rich, complex and huge source of information; the discoveries are easily actionable; and time constrains are typically hard. One general, crucial and very challenge issue of knowledge discovery is the selection of the right methods to apply according to the nature of the problem under analysis. This issue, particularly within the Web usage mining scope, is the focus of our work. This paper describes a case-based reasoning system especially oriented to assist users in the development and application of Web usage mining processes. This system takes as inputs the characteristics of the available data, the analysis’ requirements, and, based on acquired experience from successfully processes, delivers a solution: a mining plan suited to the data analysis problem at hand.
منابع مشابه
Case Mining from Large Databases
This paper presents an approach of case mining to automatically discover case bases from large datasets in order to improve both the speed and the quality of case based reasoning. Case mining constructs a case base from a large raw dataset with an objective to improve the case-base reasoning systems’ efficiency and quality. Our approach starts from a raw database of objects with class attribute...
متن کاملWeb Usage Mining Using Rough Agglomerative Clustering
Tremendous growth of the web world incorporates application of data mining techniques to the web logs. Data Mining and World Wide Web encompasses an important and active area of research. Web log mining is analysis of web log files with web pages sequences. Web mining is broadly classified as web content mining, web usage mining and web structure mining. Web usage mining is a techniques to disc...
متن کاملA Recommender System Approach for Classifying User Navigation Patterns Using Longest Common Subsequence Algorithm
Prediction of user future movements and intentions based on the users’ clickstream data is a main challenging problem in Web based recommendation systems. Web usage mining based on the users’ clickstream data has become the subject of exhaustive research, as its potential for web based personalized services, predicting user near future intentions, adaptive Web sites and customer profiling is re...
متن کاملIntroducing Softness into Inductive Queries on String Databases
In many application domains (e.g., WWW mining, molecular biology), large string datasets are available and yet under-exploited. The inductive database framework assumes that both such datasets and the various patterns holding within them might be queryable. In this setting, queries which return patterns are called inductive queries and solving them is one of the core research topics for data mi...
متن کاملMining Clinical Data: Selecting Decision Support Algorithm for the MET-AP System
We have developed an algorithm for triaging acute pediatric abdominal pain in the Emergency Department using the discovery-driven approach. This algorithm is embedded into the MET-AP (Mobile Emergency Triage Abdominal Pain) system – a clinical decision support system that assists physicians in making emergency triage decisions. In this paper we describe experimental evaluation of several data m...
متن کامل